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Abstract 

We consider the problem of multiple change-point estimation in the mean of an AR(p) 
process. Taking into account the dependence structure does not allow us to use the inference 
approach of the independent case. Especially, the dynamic programming algorithm giving the 
optimal solution in the independent case cannot be used anymore. We propose a two-step 
method, based on the preliminary estimation of the autoregression parameters. It is based 
on robust statistics techniques, since our estimator has to be robust to the change-points if 
we do not want to estimate them before. Then, we propose to follow the classical inference 
approach, by plugging this estimator in the criterion used for change-point estimation, which 
is equivalent to decorrelate the series using the estimated autoregression parameters. We show 
that the asymptotic properties of these change-point location and mean estimators are the 
same as those of the classical estimators in the independent framework. The same plug-in 
approach is then used to approximate the modified BIC and choose the number of segments, 
and to derive a heuristic BIC criterion to select both the number of changes and the order 
of the autoregression. Finally, we show, in the simulation section, that for finite sample 
size taking into account the dependence structure improves the statistical performance of the 
change-point estimators and of the selection criterion. 


1 Introduction 


Change-point detection problems arise in many fields, such as genomics (Braun and Muller, 1998 


Braun et ah, 2000 Picard et ah, 2005), medical imaging (Lavielle, 2005), earth sciences (Williams, 


2003 Gazeaux et ah, 2013) or climate (Mestre, 2000 Lu et ah, 2010). In many of these problems, 


the observations cannot be assumed to be independent. 

In the literature, in the frequentist framework, there is two ways to deal with the dependence 
structure of series affected by multiple change-points: 


Apply the methodology of the independent case, and prove that asymptotic results are not 
affected by dependence under some conditions (Lavielle and Moulines, 2000 Lavielle, 1999). 


Consider that all the parameters of the model can change at each change-point (Bardet et ah. 


2012). In fact, inference is performed by considering also possible changes in the spectrum 


of the process. 
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The parameters of the dependence structure are here assumed to be global (i.e. not depending 
on the segments) nuisance parameters that have to be estimated to perform the change-points 
inference. 

In this paper, we consider the segmentation of an AR(p) process with homogeneous autore¬ 
gression coefficients: 


y^ = ^^k + Vi, tn,k + 1 < * < C,fe+i ,0<k<m*,l<i<n. 


( 1 ) 


where is a zero-mean stationary AR(p) process. That is, it is a stationary solution of 


Vi € Z,?7 j - 4'1'ni-r = 


( 2 ) 


where the e^’s are uncorrelated zero-mean rv’s with variance and d)* = {4'*j.)i<r<p ^ i® such that 
a stationary solution to § exists. In Q, we use the following conventions: tj) g = m*+i = 

Our aim is to propose a methodology for estimating both the change-point locations = 
{tnk)i<k<m* and the means /x* = {yl.)o<k<m*, accounting for the existence of d)*. Moreover, we 
propose a criterion to choose the number of change-points m*. 

In the sequel, we shall assume that there exists r* = ('r^)o<fc<m+i such that, for 0 < fc < m + 1 
Kik~ J) denoting the integer part of x. Consequently, Tq = 0 and = 1. 

A first idea is to use the maximum-likelihood approach to estimate the parameters of the model. 
However, maximizing the likelihood function, especially in the change-point location parameters 
T, leads to a complex discrete optimization problem in an algorithmic point of view. 

When the observations are independent, the optimal segmentation (e.g. in the maximum- 
likelihood sense) can be recovered via the dynamic programming (DP) algorithm introduced by 


Auger and Lawrence (19891. The computational complexity of this algorithm is quadratic relatively 


to the length of the series. This algorithm and some of its improvements (such as these proposed 


by Rigaill, 2010 or Killick et ah, 2012) are the only one that provide exactly the optimal change- 


point location estimators. However, the DP algorithm only applies when (i) the loss function (e.g. 
the negative log-likelihood) is additive with respect to the segments and when [ii) no parameter 
to be estimated is common to several segments. 

In the autoregressive case, the likelihood function violates both requirements (i) and (ii). 
Indeed the log-likelihood is not additive with respect to the segments because of the dependence 
that exists between data from neighbor segments and the unknown coefficients needs to be 
estimated jointly over all segments. Even if <I>* was known, the DP principle would not apply to 
the log-likelihood of Model 0 as it will still not be additive. We introduce an alternative criterion, 


based on the quasi-likelihood described by Bardet et al. (2012). This criterion is equivalent to the 


classic least-squares applied to a decorrelated version of the series, computed with an estimated 
To achieve this decorrelation, we shall provide an estimator of <&*. 

We shall prove that, under mild asymptotic assumptions on the estimator of d>*, the resulting 
change-point estimators satisfy the same rate of convergence as those proposed by Lavielle and 


Moulines (2000); Bardet et al. (2012). 


We show that such an estimator of <I>* exists and can be computed before segmenting the 
series. In order to estimate <&*, we first differentiate the series of observations (i/i) and work on 
Xi = Ui- Ui-i which satisfies 


Xi = except for i = ^ + 1, where a:** = {nl - /i^.i) + t-** ^+i, 0 < fc < 


m 


(3) 
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where Vi = rji - 77^.1 is an ARMA(p,l) defined from {rji) by 


p 

4’r^i-r 

r=l 


ei-1 


(4) 


To this aim, we borrow techniques from robust estimation (Ma and Genton, 20001. Briefly speaking, 
we consider the data observed at the change-point locations as outliers and propose an estimator 
of <&* that is robust to the presence of such outliers. We shall prove that the estimator that 
we propose is consistent and asymptotically Gaussian. Moreover, we propose a model selection 
criterion on the number of changes, the order of the autoregression being fixed, inspired by the one 


proposed by Zhang and Siegmund (2007) and prove some asymptotic properties of this criterion. 


Finally, we discuss the problem of selecting jointly the number of changes and the order of the 
autoregression and propose a practical solution to this problem based on a Bayesian heuristic. 


2 Robust estimation of the autoregression coefficients 

The aim of this section is to provide an estimator of <i>* which can deal with the presence of 
change-points in the data. In the absence of change-points {m* = 0 in ([^), the estimation of d)* 


is a well-known issue (see Brockwell and Davis, 1991, for a comprehensive introduction) and a 
consistent, asymptotically Gaussian estimator is given by the Yule-Walker method. We aim to 
adapt this method to our framework. 

Since change-points can be seen as outliers in the AR(p) process, we shall propose a robust 
approach for estimating <1>*. Our approach is based on the estimator of the autocorrelation function 


of a stationary time series proposed by Ma and Genton (2000) which is based on the robust scale 


'(v) 

estimator of Rousseeuw and Croux (1993). More precisely, let us define by 


where for i < j integers. 


is an estimator of p^.j defined by 


^n,pPn,2:{p+l) ^ 


Pn,v.j i<h<j ’ 


Pi-.j = {P{h))i<h<j 


(5) 


( 6 ) 


(7) 


where p{h) denotes the autocorrelation of the process (xi) defined in ^ at lag h, and state for 
the transpose. In ([^, i?„_p denotes the following matrix 


which is an estimator of 


Moreover, for all h in Z, 


Rn,p - {Pn{j i ^))l<i,j<p 

Rp = (p(J-*-l))l<^,J<p• 


( 8 ) 


(9) 


( 10 ) 


where a;)) = {xi+h+Xi)o<i<n-h, = (xi+h-Xi)o<i<n-h and is the scale estimator of Rousseeuw and 
Croux (1993) which is such that Qn {z) is proportional to the first quartile of {l^i - Zj\;0 < i < j < n}. 
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Proposition 2.1. Let and Pi,^ = 

Under the assumption that Ui in @ is Gaussian, we have 


be defined m ([^, Q and (10). 


{pnMP^u - php^d) „-z:; ^ (O’ 


( 11 ) 


in distribution, where M {Q,V) is the {p + 1)- dimensional centered Gaussian distribution with 
covariance matrix V and 


VR-^m) , ( 12 ) 

in distribution, where 




l<i<p,l<j<p+l 


(13) 


The proof of Proposition |2.1| is given in Section 


Remark 2.1. Proposition 2.1 argues for the possibility to estimate robustly <!)*. Equations (12) 


and (13) show that the variance of depends on <&*. This variance may be large for some <&*. 
One can deal with this problem by preferring a regularized version of this estimator, that is 

(,^n,p^n,p Sn,p^ ^n,pPn,2:{p+l) J (H) 

where Sn,p is a. p x p positive semi-definite symmetric real matrix. If Sn,p = then 


Proposition |2.1| remains true with being replaced by the expression of Equation (|14[). 


3 Change-points and expectations estimation 


In this section, the number of change-points m* is assumed to be known. In the sequel, for 
notational simplicity, m* will be denoted by m. Our goal is to estimate both the change-points 
and the means in Model 0 - A first idea consists in maximizing the Gaussian quasi-likelihood 
conditioned on uq, ..., j/i_p and to minimize it with respect to <I>. Due to a quadratic term that 
involves both 5k-i and Sk, this criterion cannot be efficiently minimized. Therefore, we propose to 
use an alternative criterion defined as follows: 

m tk+i / p 

SSm{y,^,s,t) = Y, Y (yi-j^4‘ryi-r-Sk 

k=0 i=tk + l \ r=l 

Note that minimizing SSm (1 - Zr=i 4’r) is equivalent to maximizing the Gaussian log- 
likelihood, conditioned on zq,. .. ,zi-p, of the following model maximized with respect to cr: 



= Yfirizi-r-pD + a , tl,,+ l<i<tl,,,^^ , 0<k<m , l<i<‘ 


(15) 


r=l 


where the e^’s are defined as in Model ([^. In this model, as in the models considered in Bardet 
et al. (2012), the expectation changes are not abrupt anymore as in Model ([^. 


Proposition 3.1. Let (4>„) be a sequence of rv’s on and z = (zi-p,... ,Zn) a finite sequence 
of real-valued rv’s satisfying ([T^. Let 5n{z,t^n) and t„(z,<I>„) he defined by 


(^^n(z, ^n): ^n(z, 4^n)) 

Tn(z,^„) 


argmin S'S'™ (z, d, t) , 

^n(,Z, j 

n 


(16) 

(17) 
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where 


^n,m ~ {(^0; • ■ ■ 5 ^m+1) <*•*< ^fc-1 — ^n} (1^) 

and where (A„) is a real sequence such that n““A„ — >• oo, as n ->■ oo and a > 0. Assume that 

(¥„-$*) = Op (n-i/2), (19) 

as n tends to infinity. Then, 

||t„(z,$„)-x*|| =Op(n-i), ||^„(z,$„)-^*ll =Op(n-'/2), 

where || • || is the Euclidian norm. 


Proposition 3.2. The result of Proposition \3A\ still holds under the same assumptions when z is 
replaced with y satisfying 0- 


The proofs of Propositions [3T] and 3.2 are given in Section 6.2 and |6.3[ respectively. Note that 
the estimators defined in these propositions have the same asymptotic properties as those of the 


estimators proposed by Lavielle and Moulines (2000). Also, the estimator defined in Section 
satisfies the same properties as and can thus be used in the criterion SSm for providing 
consistent estimators of the change-points and of the means. 


4 Selecting the number of change-points 

4.1 Criterion to select the number of change-points, the order of the 
autoregression being fixed 

In this section, we propose to adapt the modified Bayesian information criterion (mBIC, Zhang 


and Siegmund, 2007) to our autoregressive noise framework. mBIC was proposed to select the 


number m of change-points in the mean in the particular case of segmentation of an independent 
Gaussian process x. This criterion is derived from an Op(l) approximation of the Bayes factor 
between models with m and 0 change-points, respectively. Its performance for model selection is 
assessed by simulation studies (Zhang, 2005[ Frick et ah, 2014). 

The mBIC selection procedure consists in choosing the number of change-points as: 


m = argmaxCm(a:, 0) , 

m 

where the criterion Cjn{y,^) is defined for a process y as 


( 20 ) 




^ )log>S'>S'm(y,4>)+logr^ ” 7^^ ) ~ \ f^^ognk{t{y,<t>)) - mlogn. (21) 


In the latter equation 


SSjniy,^) = mmSSm{y,^,S,t), 


s,t 


( 22 ) 


where the minimization with respect to t is performed in An,m defined in (18), and 

nkitly,^)) = 4 + 1 ( 2 /, 4>) -4(y,4>), 
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where T( 2 /,$) = ,Tm(y, $)) is defined as *"(?/,$) =argminmin SSm{y,^,S,t). 

Note that, in Model (15), the criterion could be directly applied to the decorrelated series 
V* = «)i<i<n = (y^ - <^;yi-r)i<i<„ since 

C„(y,$*) = C™(u*,0). 

We propose to use the same selection criterion, replacing d)* by some relevant estimator <I)„. 
The following two propositions show that this plug-in approach results in the same asymptotic 
properties under both Models (15) and ([^. 


Proposition 4.1. For any positive m, for a process z satisfying (15) and under the assumptions 
of Proposition \3.1\ we have 

Cm{z,^„) = +Op{l), as n-* oo . 

Proposition 4.2. For any positive m, for a process y satisfying 0 and under the assumptions 
of Proposition \3.^ , we have 

Cm{y,^n) = Cm{y,^*) + Op{l), asn^oo. 


The proofs of Propositions 4.1 and 4.2 are similar to Chakar et al. (2014 Propositions 6 and 

7). 

In practice, we propose to take <I)„ = as defined in Sectionwhich satisfies the conditions 
of Proposition |4.2| to estimate the number of segments by 


rh = arg max Cm{y, ) 


0<m<?71n 


(23) 


where Cm is defined in (21), and for a given maximum number of changes m„ 


4.2 Heuristic to select both the number of changes and the autoregres¬ 
sion order 


Applying Zhang (2005 Theorem 2.2) to the series v* = (w*)j 




l<i<n 


, for a 


process z satisfying (15) and with the corresponding priors, gives: 


that is 


logP(TO|u*) = Cm(v*,0) - Co(v*, 0) + logP(m = 0|u*) + Op(l) , 
P(to|z,p,4>) = Cm(z,^) - Co(v*,0) + logP(m = Ojv*) +Op(l) , 


From Schwarz (1978), we approximate then logP(TO,p|z), up to a constant, by 

logP^r 


f log” 


and then, up to a constant, by 


Cm (z,Pj^j{')-^ log n 


where is the maximum likelihood estimator of 4>* in the model with m changes and autore¬ 
gression order p. Replacing by and z hy y satisfying ([^, we propose to select m and p 
by 

{m',p)= argmax jc^ (y, $£^1) - ^ log n| , (24) 

0<771<?71rnax ^ ^ ) 
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with given mmax and Pmax- Even if we do not aim to estimate the order p of the autoregression, 
this criterion is interesting by being more flexible than (23). Indeed, if at the true order p, 
provides a poor estimate of d)*, a different order (e.g. p + 1) can lead to a better fitting of the 
model and then a better estimate of the number of changes. 


5 Numerical experiments 

5.1 Practical implementation 

Our decorrelation procedure introduces spurious change-points in the series, at distance < p of the 
true change-points. When the length of the series tends to infinity, the effect of these artefacts 
on estimates vanishes, but with a finite length series our procedure may be affected. To fix this 
point, we propose a post-processing to the estimated change-points tn, which consists in removing 
segments of length smaller than p: 

pp (r„,p) = 

{tn,k 6 tn] '' {tn,i SUch that 3j,tn,i-p< t^J < t^^i and (j = 1 or tnj-l < tn,j -p)} ■ 


5.2 Simulation design 

We simulated Gaussian AR{p) series yi,...,yn of length n = 7200 and n = 14400, with p = 2 
or p = 5. Additionally, the observations y_i 9 ,..., j/Oj are simulated and used for conditioning 
the quasi-likelihood. All series were affected by m* = 6 change-points located at fractions 1/6 ± 
1/36,3/6 ± 2/36,5/6 ± 3/36 of their length. The mean within each segment alternates between 0 
and 1, starting with pi = 0. We considered autoregression parameters that verify the assumptions 


to get a stationary causal process, see Brockwell and Davis (1991 Theorem 3.1.1). We focused 
our attention on some parameters for which the computed estimators have a typical behavior and 
are interesting to illustrate our method. 

For p = 2 the parameters are the following 

(</.J,</)*, a*) € {(-1.2, -0.4,0.4), (1.6, -0.8,0.4), (0.2,0.2,0.4), (0.2,0.6,0.4), (0.4,0.2,0.2)} . 

For p = 5 the parameters are the following 

a*) €{(0.5,0,0,0.5,-0.5,0.4),(0.5,0,0,0,-0.5,0.4)} . 

Each combination was replicated S = 100 times. 


5.3 Quality criteria 

To assess the quality of the estimation of the autoregression parameters, we used the Root-Mean- 
Square Errors (RMSE) of defined in ([^. 

To study the performances of our proposed model selection criteria, we computed and com¬ 
pared: 

• mP = argmaxC'm(p, 0), the criterion Cm without any decorrelation procedure. 
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• my, the estimated number of changes derived from the BIC-type penalized criterion defined 
by Yao p^88t, 


• fh* = argmaxCmC?/, d>*), the criterion Cm where the series is exactly decorrelated, 

m 

• m*pp, the post-processed number of changes, that is the number of changes of PP{tn,p) if 
tn is the vector of the estimated change-point locations obtained by the minimization ( p^ 
with <& = <&*, 

• fh = argmaxC'm(y,‘hrf^), and the result of post-processing fhpp, 

m 

• {fh\p') defined in p4| . The post-processing, giving m'pp, is performed with PP{-,p), 
where Cm is defined in ( |21[ ). We were particularly interested in the comparison of: 

• rhy and fh^ to the other estimates to illustrate how much our method improves the estimation 
of the number of changes, 

• fh* and fh*pp are compared to fh and fhpp to identify the errors coming from the estimation 
of by 


• the post-processed estimates are compared to the non post-processed estimates to assess the 
usefulness of this finite-sample correction of our method. 

In order to measure the performance of change-point location estimators, we plotted their 
frequencies. In particular we are interested in the following change-point estimates: 

• the minimizer on An,m° of minS'S'ffiO {y,0,Sp)- 

’ s 

• tn,pp = PP {tn,p), where t„ minimizes mmS'S'® (y, on An,fn- 


• ^n,pp = PP^niP')^ where minimizes mmS'S'm' on An,m'- 

To highlight the peaks, we plotted also the Gaussian kernel density estimate. The bandwith is 
selected following the method of Sheather and Jones (|1991|). 


5.4 Results 


For p=2 the simulation results suggest that the decorrelation procedure is not necessary in all 
cases. If (j)\ and ^3 negative, = 6 almost all the times (see Table and Figures 
and . If only one of the parameters is negative, problems can arise without decorrelation 
if the other parameter is positively large (see Table and Figures and |^. The core of the 
problems with fhP is located at (j)*[ > these cases, our decorrelation procedure 

is required (see Tables and and Figures[|§0§§ and |10[ ) . fh can provide poor 
estimates of m because of a poor preliminary estimate of d>*. However, in this case, fh' can 


provide better estimates, thanks to an overestimation ofp by as noticed in Section 4.2 (see 
Tables and Figures [7j . 

Tables 1^ and illustrate the usefulness of post-processing. 
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For p = 5 we can see from Tables and that our method, when p is known, has the same 
performance as the methodology which would have access to the autoregression parameters 
<!>*, see the lines m and m*. These performances are not altered by the choice of p, even 
if it is overestimated by our model selection approach, see the lines m' and p'. Moreover, 
our method outperforms a methodology which would ignore the dependence structure of the 
process, see the line of these tables. In addition, our method is not only able to select 
the true number of change-points, whatever p, but also the true change-point positions, as 
displayed in Figures [T^ and [14} 


6 Proofs 

6.1 Proof of Proposition [2TT] 

Since there is only a finite number of atypical values in the process (xi), Theorem 4 of Levy-Leduc 


et al. (20111 still holds and gives that for all fixed h>l: 


n-h 


V 4 / Vn- ft i=i 

where 7 denotes the auto covariance of (t'i) and where for all x and y, 


tjj : {x,y) ^ 

I (7(0) +7(^)) IF 


x + y 


,Q,4> -(7(0)-7(/*))IF 


x-y 


^2(7(0)-7(/i)) 




\/2(7(0) +7(^)) 

In this equation IF is defined by 

IF(x Q 4>) = eld) { I/4-^(^+lM<i>))+<i>(^-l/c(4>)) 

^ ^ ^ H 4</>(2/)<^(2/+l/c(4>))dp 

where c(4>) = l/(\/2<I>“^(5/8)) « 2.21914, and 4> is here the cumulative distribution function of 
the standard normal distribution. By Theorem 2 of Levy-Leduc et al. (2011), we obtain that 

+ Ql{xl) 


- 7 ( 0 ) = Op(l) . 


Let 7 ( 0 ) = {Ql (xl) + Ql {Xf^))/4 then 


n-h 

Vn-h{'p{h) - p{h)) = -^ ^ ip{vi,Vi+h) + op(l) , 

\/n-h i=i 

In order to prove a central limit theorem for Pi;(p+i), it is enough to prove by the Cramer-Wold 
device (Billingsley, 1995 Theorem 29.4), that for any in M: 

viol"! "-(p+i)p+i 

i=l k=l 


converges in distribution to a centered Gaussian rv. By Lemma 13 of Levy-Leduc et al. (2011), 
■0 is of Hermite rank 2. Hence, the Hermite rank of the linear combination of the 0 function is 


larger than 2. Thus, Condition (2.40) of Theorem 4 in Arcones (1994) is satisfied and the quantity 
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n ^'i+fc) converges in distribution to a centered Gaussian rv with variance 

given by 





\ 


E 

1 ^ afcV^(i/i,^fc+i) 1 

+ 2^E 
£>1 

1 ^ afci/;(i/i,i/fe+i) 1 

1 XI ak^{^e+i,i'k+e+i)j 


By Slutsky’s lemma, \/n- h {'p{h) - p{h)) converges in distribution to a centered Gaussian rv 
with variance 7 ( 0 ) 0 ^^. Since {xi -Ecci) is an ARMA(p,l) process, with autoregressive parameters 
4>i, - ■ ■ ,(t>p, we get, by Azencott and Dacunha-Gastelle (1986, Chapter 11, Paragraph 2), that Rp, 
as defined in , is invertible, and 

where P 2 :(p+i) is defined in 0. 

Let g ; 2? c MP+i ^ W defined by 


d>* = R 


1 


'p P2:(p+1) I 


5 (^) “ (^ 2 ) • ■ ■ j Up+i) 


l<i,j<p 


(25) 


Hence, 


= 9{Pi-.{p+i)) , 

= g{Pnl:(p+l)) ■ 


By ([TT]) and the delta method: 

■A/'( 0 ,Vg(pi,(p+i))^RVg(pi,(p+i))) , 


Vg(pi;(p+i)) being the Jacobian matrix of g in Pi;(p+i) ■ Let us determine Vg(u) • h for all u e H, /i e 
Using d^ , we get 

(^|j-i-l|lj-i-l*o)]^g^jgp5('^) + + ■*-l-*-l=o) i<i j<p ^ = (^ 2 ) ■ • ■ > ^p+l) 

Applied to u = Pi:(p+i), we have 


p-i-1 


2 ?pVg (pi;(p+i)) •/i - (/ij+i ^ ^ 4>i+j+i^j 


1=1 


1=1 


i<l<p 


and then Vg(pi;(p+i)) = Rp^M, where M is defined in (13). 


6.2 Proof of Proposition [3TT] 


In the sequel, we need the following definitions, notations and remarks. Observe that (15) can be 
rewritten as follows: 

JL 

(26) 


z=Y€B''z + T{t:,)5* + 

r=l 


where 


.1 ^ 


\ Zn f 


B'^Z = 


^ Zl-r ^ 


^ ^n-r j 


= 


/ A? \ 


\ C / 


\ 


\ Cn / 


(27) 


where <5^ = (1 - T,r=i 'Pr)pk’ for 0 < /c < m, and T (t) is an n x (m+ 1) matrix where the fcth column 
is ((b^G^(b^)^. 

ik-l ik-ik-1 
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( 28 ) 


Let us define the exact and estimated decorrelated series by 

w* = 

r=l 

W = Z - Z ■ (29) 

r=l 

where 

For any vector subspace E of M", let tte denote the orthogonal projection of K" on E. Let 
also II • II be the Euclidean norm on K", (•, •) the canonical scalar product on M” and || • ||oo the sup 
norm. 

For X a vector of K" and t € An,m, let 


Jn,m {x,t) = - (llTTiS^* (x) f - UE, (x) f) , 


(30) 


written Jn(x,t) in the sequel for notational simplicity. In (30), and Et correspond to the 
linear subspaces of M" generated by the columns of T (t^) and T{t), respectively. We shall use 

(31) 


the same decomposition as the one introduced in Lavielle and Moulines (2000): 

Jn (^Xj t) — (x, i) + Vji (x, i) + Wfi (^Xf t) , 


where 


Kn {X,t) 

1 II 

(ttp -TTpJEx 


n 1 ' 


Vn (X,<) 

= -( 

IIttp (x-Ex)|| 


n \ 

II *n ' II 

Wn {X,t) 

^ -( 

(x-Ex) 


n ' 


We shall also use 

the following notations: 


IFB, 


(x-Ex)f j , 


A = 

min |(5/.-(5/_i| , 

l<k<m 

A = 

max |(5/;-<5/_i| , 

l<k<m 

A,* = 

min {rf - Tf_^) , 

l<k<m+l 

r 

{t e An,m] aA”^ < ||i - t* || < } , 

C' 

'^Q;,7,n 

l^Q,7,n ^ ^ An,rm ^k — 1, . . . , TTl, tk ^ 7./c 




(32) 

(33) 

(34) 

(35) 

(36) 


Vfc 6 1, aX ^ <tk- < ujAt* and 'ik il,tk - < ctA , (37) 

for any a > 0, 0 < 7 < 1/2 and I c { 1 ,..., m}. We shall also need the following lemmas in order to 
prove Proposition |3.1| which are proved below. 

Lemma 6.1. Let {zi-p,..., Zn) be defined by 0 or (HU, then, for all r = 0,... ,p: 

\\B^z\\=Op{n^/^) , 

as n tends to infinity, where B^z is defined in 

Lemma 6.2. Let {zi-p ,..., Zn) be defined by ([^ or ( [TU then, for all t 6 An,m, 

\Jn{w,t) - Jn{w* ,t)\<-Y, |</>* - 0r,n| W z\\ {p\cl)l W z\\ +2 ||u;*||) = Op , 


where Jn is defined in (30), Bz and z are defined in (27), w* is defined in (28) and w is defined 


in (29). 
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Lemma 6.3. Under the assumptions of Proposition 3.1 ||t„-t*||oo converges in probability to 
0, as n tends to infinity. 


Lemma 6.4. Under the assumptions of Proposition 3.1 and for any a > 0, 0 < 7 < 1/2 and 
Ic 


P I min 


Q/Cn {W*,t) + Vn {W* ,t) + Wn < 0 j -> 0 , 


as n oo , 


where C' „ (P) is defined in (37) and w* is defined in (28). 


Lemma 6.5. Under the assumptions of Proposition 3.1 and for any a>0, 0 < 7 <l /2 and 
Ic 

P I min J„ (wJ, t) < 0 I —> 0 , as n ^ oo ^ 


where C' „ (I) is defined in (37) and w is defined in (29). 


Lemma 6 . 6 . Under the assumptions of Proposition \P1\ 

||t„( 2,4>„)-T*||^ = Op(n“^) . 


Proof of Lemma \6U\ Without loss of generality, assume (zi-p,..., Zn) is defined by (15). = 

then Markov inequality implies that = Op{n). □ 


Proof of Lemma \6f^ By (28), w = w* + Er=i('/’r “ We get 


Iki5t(w)f - \\nEAw*)f = 


P _ 

TTeAw*) + - (t>r,n)T^EAB''Z) 

r=l 
2 


- Ik£;t(w*)ll 


'Z(^r-^r.n)EEAB''z) 


P 


+ 2 {eeAw*),EeAB'"z)) 


r=l 
P 


^ T. P(^r - hEtiB^'zfl + 2 ^( 0 / {tTe,(w*),TTe,(B''z)) 


r=l 

P 


^ Z(</>r - k.n) {P(fr “ IkiJt II ^ + 2 (tTe, (w* ) , TTp, ) ) 

r=l 

^ T.(^r - ^r,n) {^EtiB^Z),p{(j)* -^^^„)tTe,{B^Z) + 2tTe,(w*)) . 
r=l 

The Cauchy-Schwarz inequality and the 1-Lipschitz property of projections give 


- lkBt(w*)ll ^ E -k.nl Wzll (p|k -k,n| II II + 2 k* ||) 

r=l 


The conclusion follows from (30), (19) and Lemma 6.1 


□ 


Proof of Lemma [Oj Lavielle and Moulines ( 2000[ proof of Theorem 3) give the following bounds 
for any t e An,m.. 


Kn 


Vn 


\Wn 


> Emin(- max |4 - k-r*) J 

\ 71 l<k<m ’ / 

i=l / 


> - 


nAr, 


max 

l<s<n 


+ max 

l<s<n 


^'‘)T 

i=n-s / / 


3 (m + 1)^ A 


max 

l<s<n 


2=1 


+ max 

l<s<n 


E 

i=n-s 


(38) 

(39) 

(40) 
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where A, A anf At-* are defined in (32 -34). For any a > 0, define, as in the proof of Theorem 3 of 


Lavielle and Moulines (2000), 


^n,a ~ ^ ||^ II oo — ■ 


(41) 


For 0 < a < A^*, we have: 


P(||r„(z,<i)„)> na) < p| inin Jtj (w,t) < oj 

'f'" 


< Pl rnin {Jn{w,t) - Jn{w*,t)) <-aY 


+ p{ min {Vn{w* ,t) + Wn{w* ,t)) <-a^ 


’( mi 
\t6C„ 

’( mi 
\t6C„ 


< Pj min {Jn{w,t) - Jn{w*,t)) < -aX^ 


+ P 
+ P I max 

\l<s<n 


max y e* + max V e j 


> cX^nAna 


2 = 1 


+ max 

l<s<n 


E 


> cX^naX 


for some positive constant c. The last two terms of this sum go to 0 when n goes to infinity (Lavielle 
and Moulines, 2000 proof of Theorem 3). To show that the first term shares the same property, 
it suffices to show that J„ {w, t) - {w* , t) is bounded uniformly in t by a sequence of rv’s which 

converges to 0 in probability. This result holds by Lemma [ 6 . 2 1 □ 


Proof of Lemma I d. 4 Using Lavielle and Moulines (2000 Equations (64-66)), one can show the 
bound (73) of Lavielle and Moulines (2000) on 

p( min (Kniw*,t)+ Vniw*,t)+ Wniw*,t)) <o\. 

Using the same arguments, we have the same bound on 


PI min 


{w*, t) +14 (ui*,t) +114 (w*,t) j < oj. 


We conclude using Lavielle and Moulines (2000 Equations (67-71)). 


□ 


Proof of Lemma |d.5| By (31) 


PI min 


Jn iw,t) < oj < p|^ C™™(I) ~ < oj 

+ p( min i]-Kn{w* ,t) + Vn{w* ,t) + Wn{,w* ,t)\<{n . 


By Lemma [ 6 ^ the conclusion thus follows if 


PI min 
U6CL,,,„(2:) 


^ J„ {W,t) - Jn {W*,t) + ]^Kn (W*,t)j < oj -> 0 , 


as n ^ oo . 


Since min iF„ {w*,t) > (1 - 7 ) AT*a (Lavielle and Moulines, 


2000 


Equation (65)), 


mm 


{^Jn iw,t) - Jn + ^Kn (w* < oj 
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( mir 


< P( ^ ^pin (J„ {w,t) - J„ (w*,t)) < - (7 - 1) A 


x* , 


and we conclude by Lemma 6.2 


□ 


Proof of Lem7na \6.6[ For notational simplicity, tn{z^ will be replaced by in this proof. Since 
for any a > 0 , 

-P(||C -*nl|oo < = P(||t„ - t* 11^ < - p(i„ € Ca,j,n) , 


it is enough, by Lemma 6.3 to prove that 

F e > 0 , 


as n ^ 00 , 


for all a > 0 and 0 < 7 < 1 / 2 . Since Ca, 7 ,n = U Ca,-y,n n {t e An,m] V/c C k}^ shall 

only study one set in the union without loss of generality and prove that 


P 


{tn 6 Ca, 7 ,n) ^ 0 , aS 


where n is defined in (36 1 . Since n = U ^'a~t n 0-)i shall only study one set in 


the union without loss of generality and prove that 

P {tn 6 Ca,^,n O') ) ^ ^ ) 

Since 


as n ->• oo . 


P 


(C e 7 (I)) < 


the proof is complete by Lemma | 6 . 5 


□ 

Proof of Proposition \3.1\ For notational simplicity, Sn(z, <!>„) will be replaced by Sn in this proof. 
By Lemma [ 6 ^ the last result to show is 

ll^r.-V11= Op (n-i/2) , 

that is, for all k, Sn,k - ^ Op (281 and (29), 

^ ^7l,fc+l ^ 


■,k T 


E 


tnMl ^n,k 


By the Cauchy-Schwarz inequality. 


' (Pn,k+l Pn,k ) 


E W'i + E i^r - ^r,n) E 


in,k+l 

_E 

i=t~i u + 1 


— {fn,k+l ^n,/c) 


i=tn^k + l 


vl/2 




where the last equality comes from Lemma 6.1 Hence by (19) and Lemma 6.6 
4,fe = -p- —r E <+Op(n"i/2) 

n(r„,fe+i 

/ tn,k+l tn.k+l 


1 


POki^k+1 Pn,k) 


E + E e* 


, i=tn,fc + l 


j=tTi.fc + l 


+ Op(n-i/2), 


.+1 


where the last equality comes from (26) and (28). 
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Let us now prove that 


n{Tn,k+l i=t„,fc + l 


^ e, = Op(n-i/^) . 


(42) 


By Lemma 6.3 n ^ (rn^k+i - Tn,k) = Op{n ^). Moreover, 


1+1 


^ ■ 

i=t„,k + l ®=*^,fc + l i=tn,fe + l ®=‘^,fc+i 


(43) 


By the Central limit theorem, the first term in the right-hand side of (431 is Op{n^^^). By 
using the Cauchy-Schwarz inequality, we get that the second term of (431 satishes: | ^ 

K,k - in,k\^^^ {1,7=1 = Op{l)Op{n^/'^) = Op(n^/^), by Lemma 


6.6 


The same holds for the 


last term in the right-hand side of (43), which gives ( |42[ ). 
Hence, 


- Ofe 


1 




n,fc+l 1 ”\ 


fc,fc) 


^ {¥,w:-5l) + Op{n-^l^) 

E 


{¥,wl-5l) + Op{n-^l^) 


and then 

\Sn,k-Sl\ < 


—^-=-T tl {in,k + 1, ■ ■ ■ , tn,/c+l} \ {tl^k + 1> ■ ■ ■ : C,fe+l} , ^ax \S7 - Sl\ 

n{Tn^k+l-Tn,k) U=0,...,m 


+ Op{n-^/^) . 


We conclude by using Lemma 
and Lemma 


6.3 


6.6 


to get d {tn,k + 1, ■ ■ • , in.fe+l} ^ {C .fc + 1> ■ ■ ■ > t*uMl} = Op (1) 


to get (T„,fe+ 1 -T„,fc) =Op(l). 


□ 


6.3 Proof of Proposition |3.2| 

The connection between Models 0 and (dH) is made by the following lemmas. 
Lemma 6.7. Let {yo,---yn) be defined by 0 and let 

p 

~ Vi ~ Y, l^rl/i-l! 

r=l 

|-(Mfc-/ife-i)n=r-C */* = C.fc + ^ apdl<r<p 
lO, otherwise, 

where the ’s are defined in ([^, then the process 

w*i = v* + A* 


v7 

A* 


(44) 

(45) 


(46) 


equals Zi - Er=i ^i-r where (zi-p, 
structed recursively as 


verify (15). 


Such a process (zi-p,..., Zn) can be eon- 


\zi = i/i /or 1 - p < i < 0 

[Zi = w* +Z7=l(/^rZi-r for i> 0. 


(47) 
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Lemma 6.8. Let {yi-p,... ,yn) be defined by Q and let z be defined by {44~ 4''^b Then 


where 



Wi = v^ + Ai 

(48) 

Vi = 

P _ 

yi ~ XI ^r^nUi-r 

r=l 

(49) 

Wi = 

p _ 

r=l 

(50) 

A, = 

P _ 

XI ~ ^r,n^ (^i-r ~ Vi-r) ■ 

r=l 

(51) 

0 < 2 <n 

defined in {51). Then \\A\\ = Op {!). 



Proof of Lemma \6.'i\ Let z being defined by (47). Using (46), we get, for all 0 < fc < m, t* i < 
t* 


.,k+l 


(Zi - - E 4>r (Zi-r - Pk) = {Vi - /^fe) - E {Vi-r “ Tk) + K 

r=l r=l 


= {y^ - MD - E [vi-l - (MEllr•>^-t; j) 

r=l 

This expression equals (?/i - E (yj) - Er=i '/>r {Vi-r - E( 2 /i_r)) = T?i-Zr=i ‘Prli-r = Ci by ([^ and ([^. 
Then z satisfies (15). □ 

The proof of Lemma |6.8| is straightforward. 


Proof of Lemma \6.9[ (51) can be written as 

p 


A = A*+ ^ (B^y-B’-z) 


where A* = (A*)^^.^^, B^'y = (yi-r-)i<i<„ and B^z is defined in (27). By the triangle inequality, 


A||<||A*|| + f: „|(||i?’'y|| + ||i?’'z||). 

r=l 


(52) 


Since ||A*|| is constant in n it is bounded. The conclusion follows from (52), (19) and Lemma 

□ 


Proof of Proposition\3.^ Let y, z, v, w and A be defined in Lemma 6.8 


Using (30) and Lemma we get 

.X 2 

n 


Jn {v,t) = Jn {w,t) + Jn (A,t) - - (A)^ - (tTb^ (w) , 7r£)j (A)}j . 

By the Cauchy-Schwarz inequality and the 1-Lipschitz property of projections, we have 


|j„(A,i)| < 2||Af, 

(w),7r£;^* (A))-(7r£;, (uJ),7rE, (A)}| < 2||A||||?i;||. 
Note that w = z - fi^ .^B'^z thus by the triangle inequality 


NI<lkll+El<^..nll|B'’^ll 

r=l 
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Since |(/>r,nl = Op (1) for all 1 < r < p, we deduce from Lemma 6.1 that ||w|| = Op Since, 


by Lemma 6.9 ||A|| = Op (1), we obtain that 

sup|j„ (A,t) - ^ {w),nEt (A)))| = Op(n"^/^) . 

For 0 < a < A^-*, using @ and ( |4T| ), we get: 

> aj < p|^rnin J„(u, t)<o] 

< P I rnin {j„ {w,t) + Jn (A,i) 

" ^ (^)) - {w),^E, (A)))| < oj 

< P l^min [Kr, (uJ, t) + {w, t) + 1F„ (w, t) + J„ (A, t) 

~ \ ((^^< (^) (A)) - {tte, {w),tte, (A)))| < oj 

< ^^K„{w,t)+ Vn(w,t)+ Wniw,t)j <0^ 
+p|^min i^^Kn{w,t) + Jn{A,t) 

~ \ (^))- k^E,{ w),t^e, (a)))|< o). 

Following the proof of Lemma |6.3[ one can prove that 

p( min 1\k„ (w,t) + 14 (uJ,i) + W4 {w,t)\ < 0) —> 0 . 

\ t6C„,c. 12 ) J n^oo 


Using (p58|), we get that 
P 


|^A:„(w,t) + J„(A,t) - ^ {^))-{^Et iw),TTE^ (^)))} - 0 

^ ^’l^A^a + ^min |j„(A,i) - ^ (w) (A)|- (ttp, (u;),7r£;, (A)})| < oj 


(53) 


which goes to zero when n goes to infinity by (53). 


Then Lemma 6.3 still holds if y is defined hyM- To show the rate of convergence, we use the 
same decomposition. As in the proof of Lemma 6.6 PI min J„ (v, t) < 0 I —>• 0 for all a > 0 

and 0 < 7 < 1/2 is a sufficient condition for proving that P (^tn{y,Pn) ^Ca,-y,n} —0, which 
allows us to conclude on the rate of convergence of the estimated change-points. Note that 

p( min Jn(F, i)<o) < p( min (w, t) + 14 (u;, i) + 114 (w, t)| < 0 ) 

+ P ^ -A^a + J„ (a, t) 

(w) ,7r£;^. (A)| - (ttp, (ui) ,7ri5, (A)}) < oj . 


2 

n 


In the latter equation, the second term of the right-hand side goes to zero as n goes to infinity by 


(53). The first term of the right-hand side goes to zero when n goes to infinity by following the 


same line of reasoning as the one of Lemma |6.5[ This concludes the proof of Proposition |3.2| □ 
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7 Tables and figures 


n 

7200 

14400 

estimate \number of changes 

< 5 

5 
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7 
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3 
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0 
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0 

99 

1 

0 

0 

0 
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0 
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estimate \order of the autoregression 
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1 

2 

3 

> 3 

0 

1 

2 

3 

> 3 

P' 

0 

0 

96 

4 

0 

0 

0 

97 

1 

2 

RMSE 

1.99-10-2 

1.64-10-2 

RMSE 

1.80-10-2 

1.54-10-2 


Table 1: Estimates of the number of changes, of the order of the autoregression, and RMSEs of the 
estimates of the autoregression parameters, for 100 AR(2) series with the parameters = 

(-1.2,-0.4,0.4). 



Figure 1: Boxplots of the estimates of the number of changes for 100 AR(2) series with the 
parameters (</>*,o’*) = (-1.2,-0.4,0.4). n = 7200 (top) or 14400 (bottom). In each plot, the 
estimates boxplots are in the following order (from left to right): fhy, m, fhpp, fh*, m*pp, 
fh', fh'pp. The true number of changes is equal to 6 (red horizontal line). 
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Figure 2: Frequency plots of the change-point location estimates for 100 AR(2) series with the 
parameters = (-1.2,-0.4,0.4). n = 7200 (left) or 14400 (right). Estimates: (top), 

Tn^PP (middle), pp (bottom). The black line represents the absolute frequency of each location 
between 1 and n in estimates (scale on right axis). The red line represents the Gaussian kernel 
density estimate of this dataset (scale on left axis). 
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n 

7200 

14400 

estimate \number of changes 

< 5 

5 

6 

7 

> 7 

<5 

5 

6 

7 

> 7 

fhy 

0 

0 

0 

0 

100 

0 

0 

0 

0 

100 

fffi 

0 

0 

0 

0 

100 

0 

0 

0 

1 

99 

fh 

3 

0 

91 

6 

0 

0 

0 

99 

1 

0 

fhpp 

3 

0 

97 

0 

0 

0 

0 

100 

0 

0 

fh* 

0 

0 

94 

6 

0 

0 

0 

99 

1 

0 

Tfl pp 

0 

0 

100 

0 

0 

0 

0 

100 

0 

0 

fh' 

4 

0 

90 

6 

0 

0 

0 

98 

2 

0 

Tflpp 

4 

0 

96 

0 

0 

0 

0 

99 

1 

0 

estimate \order of the autoregression 

0 

1 

2 

3 

> 3 

0 

1 

2 

3 

> 3 

P' 

0 

0 

46 

24 

30 

0 

0 

55 

15 

30 

RMSE 

4.93-lO-^ 

3.46-10-2 

RMSE 

3.13-10-2 

2.16-10-2 


Table 2: Estimates of the number of changes, of the order of the autoregression, and RMSEs of the 
estimates of the autoregression parameters, for 100 AR(2) series with the parameters (/) 2 , cr*) = 
(1.6,-0.8,0.4). 



Figure 3: Boxplots of the estimates of the number of changes for 100 AR(2) series with the 
parameters = (1.6,-0.8,0.4). n = 7200 (top) or 14400 (bottom). In each plot, the 

estimates boxplots are in the following order (from left to right): rhy, m, mpp, fh*, fh*pp, 
fa', fh'pp. The true number of changes is equal to 6 (red horizontal line). 


20 
























































Figure 4: Frequency plots of the change-point location estimates for 100 AR(2) series with the 
parameters = (1-6,-0.8,0.4). n = 7200 (left) or 14400 (right). Estimates: (top), 

tn,pp (middle), pp (bottom). The black line represents the absolute frequency of each location 
between 1 and n in estimates (scale on right axis). The red line represents the Gaussian kernel 
density estimate of this dataset (scale on left axis). 
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n 

7200 

14400 

estimate \number of changes 

< 5 

5 

6 

7 

> 7 

<5 
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7 

> 7 

ffly 
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0 
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fnP 

0 

0 

51 

20 

29 

0 

0 

60 

17 

23 

m 

3 

0 

96 

3 

1 

0 

0 

98 

2 

0 

fhpp 

3 

0 

97 

2 

1 

0 

0 

98 

2 

0 

fh* 

0 

0 

97 

3 

0 

0 

0 

99 

1 

0 

TTl pp 

0 

0 

98 

2 

0 
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0 

99 

1 
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fh' 

0 

0 

97 

3 

0 

0 

0 

99 

1 

0 

TTl pp 

0 

0 

98 

2 

0 

0 

0 

99 

1 
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estimate \order of the autoregression 

0 

1 

2 

3 

> 3 

0 

1 

2 

3 

> 3 


0 

0 

59 

25 

16 

0 

0 

45 

30 

25 

RMSE 

7.00-10-2 

6.44-10-2 

RMSE 

4.20-10-2 

3.68-10-2 


Table 3: Estimates of the number of changes, of the order of the autoregression, and RMSEs of the 
estimates of the autoregression parameters, for 100 AR(2) series with the parameters (/) 2 , cr*) = 
(0.2,0.2,0.4). 



Figure 5: Boxplots of the estimates of the number of changes for 100 AR(2) series with the 
parameters (^ 27 o’*) = (0.2,0.2,0.4). n = 7200 (top) or 14400 (bottom). In each plot, the 
estimates boxplots are in the following order (from left to right): rhy, fhpp, fn*, fh*pp, 

fa', fh'pp. The true number of changes is equal to 6 (red horizontal line). 
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Figure 6: Frequency plots of the change-point location estimates for 100 AR(2) series with the 
parameters = (0.2,0.2,0.4). n = 7200 (left) or 14400 (right). Estimates: (top), 

tn,pp (middle), pp (bottom). The black line represents the absolute frequency of each location 
between 1 and n in estimates (scale on right axis). The red line represents the Gaussian kernel 
density estimate of this dataset (scale on left axis). 
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0 
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0 
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9 
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fhpp 

13 

0 

28 

3 

56 

25 

0 

33 

5 

37 

fh* 
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1 
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0 
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11 

1 

TTl pp 
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7 
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98 

2 

0 
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22 

0 
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1 
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0 

0 

18 
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70 

0 

0 

14 

12 

74 

RMSE 

3.44-10^1 

2.40-10-1 

RMSE 

2.41-10-1 

1.71-10-1 


Table 4: Estimates of the number of changes, of the order of the autoregression, and RMSEs of the 
estimates of the autoregression parameters, for 100 AR(2) series with the parameters (/) 2 , cr*) = 
(0.2,0.6,0.4). 



Figure 7: Boxplots of the estimates of the number of changes for 100 AR(2) series with the 
parameters ((> 2 ,cr*) = (0.2,0.6,0.4). n = 7200 (top) or 14400 (bottom). In each plot, the 
estimates boxplots are in the following order (from left to right): rhy, fhpp, fn*, fh*pp, 

fa', fh'pp. The true number of changes is equal to 6 (red horizontal line). 
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Figure 8: Frequency plots of the change-point location estimates for 100 AR(2) series with the 
parameters = (0.2,0.6,0.4). n = 7200 (left) or 14400 (right). Estimates: (top), 

tn,pp (middle), pp (bottom). The black line represents the absolute frequency of each location 
between 1 and n in estimates (scale on right axis). The red line represents the Gaussian kernel 
density estimate of this dataset (scale on left axis). 
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n 

7200 

14400 

estimate \number of changes 

< 5 

5 

6 

7 

> 7 

< 5 

5 

6 

7 

> 7 

rhy 

0 

0 

0 

0 

100 

0 

0 

0 

0 

100 

fhP 

0 

0 

0 

0 

100 

0 

0 

0 

0 

100 

fh 

2 

0 

52 

28 

18 

1 

0 

59 

28 

12 

rhpp 

3 

0 

85 

10 

2 

1 

0 

96 

3 

0 

fh* 

0 

0 

65 

28 

7 

0 

0 

70 

29 

1 

Tfl p p 

0 

0 

98 

2 

0 

0 

0 

98 

2 

0 

fh' 

0 

0 

63 

28 

9 

0 

0 

74 

24 

2 

TTl pp 

0 

0 

100 

0 

0 

0 

0 

99 

1 

0 

estimate \order of the autoregression 

0 

1 

2 

3 

> 3 

0 

1 

2 

3 

> 3 

f 

0 

0 

36 

20 

44 

0 

0 

36 

21 

43 

RMSE 

1.11-10^^ 

8.17-10-2 

RMSE 

5.16-lO'^ 

3.76-10-2 


Table 5: Estimates of the number of changes, of the order of the autoregression, and RMSEs of the 
estimates of the autoregression parameters, for 100 AR(2) series with the parameters (/) 2 , cr*) = 
(0.4,0.2,0.2). 



Figure 9: Boxplots of the estimates of the number of changes for 100 AR(2) series with the 
parameters ((> 2 ,cr*) = (0.4,0.2,0.2). n = 7200 (top) or 14400 (bottom). In each plot, the 
estimates boxplots are in the following order (from left to right): rhy, fhpp, fn*, fh*pp, 

fa', fh'pp. The true number of changes is equal to 6 (red horizontal line). 
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Figure 10: Frequency plots of the change-point location estimates for 100 AR(2) series with the 
parameters = (0.4,0.2,0.2). n = 7200 (left) or 14400 (right). Estimates: (top), 

tn,pp (middle), pp (bottom). The black line represents the absolute frequency of each location 
between 1 and n in estimates (scale on right axis). The red line represents the Gaussian kernel 
density estimate of this dataset (scale on left axis). 
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0 
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0 

0 
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0 

0 

0 

0 
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fh^ 

0 

0 

31 

16 

53 

0 

0 

31 

14 

55 

fh 

2 

0 

90 

4 

4 

0 

0 

100 

0 

0 

Tflpp 

2 

0 

92 

2 

4 

0 

0 

100 

0 

0 

fh* 

0 

0 

100 

0 

0 

0 

0 

99 

1 

0 

Tfl pp 

0 

0 

100 

0 

0 

0 

0 

100 

0 

0 

fh' 

0 

0 

99 
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0 

0 

0 

99 

1 

0 

Tflpp 

0 

0 

99 

1 

0 

0 

0 

100 

0 

0 

estimate \order of the autoregression 

<4 

4 

5 

6 

> 6 

<4 

4 

5 

6 

> 6 


0 

0 

41 

25 

34 

0 

0 

45 

21 

34 

RMSE 

1 . 01 - 10 '^ 

6.92-10-2 

RMSE 

4.36-10-2 

3.19-10-2 

RMSE 

3.54-10-2 

2.45-10-2 

RMSE 

2.48-10-2 

1.84-10-2 

RMSE 

3.72-10-2 

2.35-10-2 


Table 6 : Estimates of the number of changes, of the order of the autoregression, and RMSEs 
of the estimates of the autoregression parameters, for 100 AR(5) series with the parameters 
= (0.5,0,0,0.5,-0.5,0.4). 



Figure 11: Boxplots of the estimates of the number of changes for 100 AR(5) series with the 
parameters ()) 2 ,<(> 4 ,</> 5 ,cr*) = (0.5,0,0,0.5,-0.5,0.4). n = 7200 (top) or 14400 (bottom). In 
each plot, the estimates boxplots are in the following order (from left to right): m^, m°, m, fhpp, 
fh*, fh*pp, m', fh'pp. The true number of changes is equal to 6 (red horizontal line). 


28 



























































LJ. 

LsA^ 


- 










V. 

il- 









o 


o o o o o 

o o o o o 

C\J O ■«t 03 IM 

CO ^ m CD 



Figure 12: Frequency plots of the change-point location estimates for 100 AR(5) series with 
the parameters = (0.5,0,0,0.5,-0.5,0.4). n = 7200 (left) or 14400 (right). 

Estimates: (top), Tn^pp (middle), t'^pp (bottom). The black line represents the absolute 

frequency of each location between 1 and n in estimates (scale on right axis). The red line 
represents the Gaussian kernel density estimate of this dataset (scale on left axis). 
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0 
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0 
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0 

0 

0 

0 
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fh^ 

0 

0 

53 

5 

42 

0 

0 

54 

2 

44 

fh 

0 

0 
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0 

0 

0 

0 

100 

0 
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Tflpp 

0 

0 
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0 
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100 

0 

0 

fh* 
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0 
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0 

0 
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100 

0 
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0 
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0 
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estimate \order of the autoregression 

<4 

4 

5 

6 

> 6 

<4 

4 

5 

6 

> 6 


0 

0 

72 

17 

11 

0 

0 

83 

12 

5 

RMSE 

2.99-lO'^ 

1.77-10-2 

RMSE 

1.24-10-2 

1.05-10-2 

RMSE 

1.25-10-2 

1.03-10-2 

RMSE 

1.28-10-2 

1.01-10-2 

RMSE 

1.29-10-2 

9.47-10-3 


Table 7: Estimates of the number of changes, of the order of the autoregression, and RMSEs 
of the estimates of the autoregression parameters, for 100 AR(5) series with the parameters 
= (0.5,0,0,0,-0.5,0.4). 



Figure 13: Boxplots of the estimates of the number of changes for 100 AR(5) series with the 
parameters = (0.5,0,0,0,-0.5,0.4). n = 7200 (top) or 14400 (bottom). In 

each plot, the estimates boxplots are in the following order (from left to right): m^, fhP^ m, fhpp, 
fh*, fh*pp, m', fh'pp. The true number of changes is equal to 6 (red horizontal line). 


30 























































- 

1 




- 

-ixM 




u 

k - 



Figure 14: Frequency plots of the change-point location estimates for 100 AR(5) series with 
the parameters = (0.5,0,0,0,-0.5,0.4). n = 7200 (left) or 14400 (right). 

Estimates: (top), (middle), t'^pp (bottom). The black line represents the absolute 

frequency of each location between 1 and n in estimates (scale on right axis). The red line 
represents the Gaussian kernel density estimate of this dataset (scale on left axis). 
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